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Abstract 

■ We consider network design problems for information networks where routers can replicate 

O^l I data but cannot alter it. This functionality allows the network to eliminate data-redundancy in 

traffic, thereby saving on routing costs. We consider two problems within this framework and 
design approximation algorithms. 

The first problem we study is the traffic-redundancy aware network design (RAND) problem. 
On I We are given a weighted graph over a single server and many clients. The server owns a number 

of different data packets and each client desires a subset of the packets; the client demand sets 
form a laminar set system. Our goal is to connect every client to the source via a single path, 
. such that the collective cost of the resulting network is minimized. Here the transportation cost 

I over an edge is its weight times times the number of distinct packets that it carries. 

• i The second problem is a facility location problem that we call RAFL. Here the goal is to 

^ I find an assignment from clients to facilities such that the total cost of routing packets from the 

facilities to clients (along unshared paths), plus the total cost of "producing" one copy of each 
desired packet at each facility is minimized. 
^ i We present a constant factor approximation for the RAFL and an O(logP) approximation 

' for RAND, where P is the total number of distinct packets. We remark that P is always at 

•/^ . most the number of different demand sets desired or the number of clients, and is generally 

' much smaller. 
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1 Introduction 



We consider network design problems for information networks where edges can replicate data but 
cannot otherwise alter it. In this setting our goal is to exploit the redundancy in the given traffic 
matrix to save on routing costs. Formally, we are given a graph over a single server and many 
clients. The server has a universe of data packets available, and each client desires a subset of the 
packets. The goal is to determine a collection of paths, one from the source to each client, such 
that the total cost of routing is minimized. Here the cost of routing on an edge is proportional to 
the total size of the distinct packets that the edge carries. For example, if the edge belongs to two 
paths that each carry the same packet, then the edge only needs to route the packet once and not 
twice. We call this the traffic-redundancy aware network design problem, or RAND for short. 

RAND arises in networks that face a lot of data duplication. Consider for example a Netflix 
server serving movies to a large and varied clientele. Each client desires a certain subset of the 
movies. What routing paths should the server use to send the data to the clients so as to minimize 
its total bandwidth usage? If different movie streams involve disjoint sets of packets this boils 
down to setting up a single multicast tree, or solving the minimum Steiner tree problem, once per 
movie. However, the server may want to set up a single routing path for each client regardless 
of how many different movies the client desires. Moreover, clients desiring the same movie may 
desire it at different rates or qualities depending on their location or the device they are using. For 
example, a desktop user with a broadband connection may desire a high definition video, whereas 
a mobile phone user may be content with a much lower resolution. Then, the data sent to these 
clients is not identical but has some amount of overlap. The server can exploit this redundancy 
in traffic by using common paths for clients with similar demands, thereby saving on the actual 
amount of traffic routed. Redundancy in data can arise even across different movies when data 
streams are broken down into small enough packets. Anand et al. [1] show that this kind of traffic 
redundancy is highly prevalent in the Internet, and it can be eliminated across individual links by 
routers employing packet caches. 

Given the cost structure that this redundancy generates, it makes sense to try to route the 
demands of clients desiring similar sets along overlapping paths. An extreme example of the 
benefit of merging paths is when all clients desire the same set of packets. In this case, the problem 
becomes equivalent to finding the minimum-cost Steiner tree over the clients and the server. At 
the other extreme, if all the clients desire disjoint sets of packets, then merging does not help at 
all, and it is optimal to pick the shortest path from every client to the source. There is thus a 
trade-off between routing demands along shortest paths and trying to merge the paths of clients 
with similar demands. 

We also study a facility location version of the problem that we call traffic-redundancy aware 
facility location or RAFL. This problem is motivated by the prevalence of content distribution 
networks (CDNs) in the Internet. Netfiix servers, instead of connecting to clients directly, cache 
their data at multiple servers spread around the network that are hosted by a CDN such as Akamai; 
each client then connects to a CDN server individually to obtain its data. The savings in this case 
comes from assigning clients with similar demand sets to the same CDN server and sending a single 
copy of the multiply desired packets to the server. Formally we are given a network over potential 
facilities and clients. Each client, as before, desires a subset of the available packets. Our goal is 
to assign each client to a facility and route to each facility the union of the demand sets desired 
by clients assigned to it. The cost of such a solution is the sum of the cost of routing packets from 
facilities to clients (that are proportional to the size of the respective client's demand set), and the 
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cost of routing packets to facilities (that is proportional to the total size of the distinct packets 
being routed to the facility). The savings from redundancy in this case are realized in the facility 
opening costs where we assign multiple clients with similar demand sets to the same facility and 
pay for each of the common packets only once. 

RAND and RAFL model information networks as opposed to commodity networks in tradi- 
tional network design problems. They can therefore be considered as intermediate models between 
traditional network design and network coding. In the latter the information network is allowed to 
use coding to increase its capacity. In our context the network can eliminate redundant information 
but cannot otherwise alter the information. See |TT] and references therein for work on network 
coding. 

Our results and techniques. We study the RAND and RAFL under a laminar demands as- 
sumption. In particular, we assume that the sets of packets demanded by clients form a laminar 
set family. In other words, every pair of demanded packet sets is either disjoint or one is a subset 
of the other. Such a structure arises, for example, in the Netflix problem described above when 
layered coding is used; the packets for a lower encoding rate are a subset of the packets for a higher 
encoding rate. 

RAND generalizes the minimum Steiner tree problem and RAFL generalizes metric uncapaci- 
tated facility location (MUFL); therefore both problems are NP-hard. We study approximations. 

We develop a constant factor approximation for the RAFL based on the natural LP-relaxation 
of the problem. Our algorithm follows the filtering approach developed by Lin and Vitter [9j and 
later exploited by Shmoys et al. [U] in the context of MUFL. In the MUFL setting, in the filtered 
solution, each client t is associated with a set of facilities, say J-{t), such that the routing cost of 
any of those facilities is comparable to the routing cost that the client pays in the LP solution. 
It is then sufficient to open a set of facilities in such a way that each can be charged to a client 
with a distinct set of associated facilities. Clients t that are not charged (a.k.a. free riders) are 
rerouted to the facilities opened for other charged clients t' such that -F(i) and J-{t') overlap, and 
furthermore the routing cost for t' is smaller than that for t. Then, using the triangle inequality, the 
cost of rerouting can be bounded and this gives a constant factor approximation. In our setting, it 
is not sufficient to ensure that the routing cost of t' is smaller than that of t. In addition, we must 
ensure that the facility opened by t' can support the demand of t, otherwise every time we reroute 
a client we incur extra facility costs. Ensuring these two properties is tricky because clients with 
low routing costs may also have small demand sets; so essentially, these properties require us to 
consider clients according to two distinct and potentially conflicting orderings. 

In order to deal with this issue, our algorithm is run in two phases. In the first phase we consider 
clients in order of increasing routing costs, in order to determine which clients will pay for their 
facilities and which ones are free riders. In the second phase, we consider free riders in decreasing 
order of the sizes of their demand sets. Each free rider is associated with a set of paying clients 
that pay for the facility that this client opens. Every time a free rider opens a facility, we reroute to 
it all of the other clients whose paying neighbors overlap with those of this facility. In this manner 
we can ensure that whenever a client is rerouted, it is routed to a facility that already produces 
the packets it needs. Unfortunately, our algorithm charges each paying client multiple times for 
different facilities opened. We ensure that the costs that a client pays each time it is charged form 
a geometrically decreasing sequence, and can therefore be bounded in terms of the LP solution. 
Overall we obtain a 27-approximation. 
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For RAND an O(logn) randomized approximation can be obtained via tree embeddings 
because the problem is trivially solvable on trees; here n is the number of nodes in the network. 
We give a simple deterministic combinatorial 0(log P) approximation, where P is the number of 
distinct packets to be routed. Note that if two or more packets are essentially identical in that 
they are desired by exactly the same set of clients, then we can combine them into a single packet 
(albeit with a larger size). Then, under the laminar demands assumption, P is always at most the 
number of distinct demand sets or clients, which is at most n, the total number of nodes in the 
network. In fact in applications such as the Netflix multicast problem described above, we expect 
n to be much larger than P. 

Furthermore, our O(logP) approximation algorithm is a natural combinatorial algorithm that 
is simple and fast to implement. It is convenient to represent the laminar family of demand sets 
in the form of a tree where the demand set at any node is a proper subset of that at its parent 
and disjoint from those at its siblings. Our algorithm begins with some minor preprocessing of the 
demand tree to ensure that the tree has small (logarithmic) height. It then traverses the demand 
tree in a top-down fashion, and at each node of the tree constructs a (approximately optimal) 
Steiner tree over all the terminals with the corresponding demand set connecting them to the 
source. We show that the cost of the Steiner trees constructed at every level of the demand tree 
is bounded by a constant times the cost of the optimal solution, and therefore obtain an overall 
approximation factor proportional to the height of the tree. The height of the tree can however be 
much larger than logP because packets have different sizes. In order to prove an approximation 
factor of 0(logi-*) we need to do a more careful bounding of the total cost spent on long chains of 
degree 2 in the demand tree. Using the Steiner tree algorithm of Robins and Zelikovsky [12j as a 
subroutine we obtain a (6.2 log P)-approximation. 

Related work. RAND is closely related to single-source uniform buy-at-bulk network design 
(BaBND) [21 El [10] in that both problems involve a trade-off between picking short paths between 
the source and the clients and trying to merge different paths to avail of volume discounts. However 
the actual cost structure of the two problems is very different. In BaBND the cost on an edge is a 
concave function of the total load on the edge; In our setting the cost is a submodular function of 
the clients using that edge. Neither of the problems is a special case of the other. BaBND admits a 
constant approximation in the single-source version [5l[T6] and an O(logn) approximation is known 
for the general multi-source multi-sink problem [2]. 

One way of thinking about RAND is to break-up the problem and solution packet-wise: each 
packet p defines a subset of the terminals, say Tp, that desire that packet; the solution restricted 
to these terminals is essentially solving a Steiner tree problem over the set Tp U {s}. Our goal is to 
pick a single collection of paths such that the sum over packets of the costs of these Steiner trees 
is minimized. In this respect, the problem is related to variants of the Steiner tree problem where 
the set of terminals is not precisely known before hand. This includes the maybecast problem of 
Karger and Minkoff [8] for which a constant factor approximation is known, as well as the universal 
Steiner tree problem [7] for which a randomized logarithmic approximation can again be obtained 
through tree embeddings and this is the best possible [3]. 

Facility location has been extensively studied under various models. The models most closely 
related to RAFL are the service installation costs model of Shmoys et al. [13] and the heirarchical 
costs model of Svitkina and Tardos In the former, each client has a production cost associated 
with it; the cost of opening a facility is equal to a fixed cost associated with the facility plus the 
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production costs of all the clients assigned to that facility. One way of representing these costs is 
in the form of a two-level tree for each facility with the fixed cost for the facility at the root of 
the tree and the client-specific production costs at the next level nodes. The cost of assigning a 
set of clients to the facility is the total cost of the subtree formed by the unique paths connecting 
the client nodes to the root of the tree. Svitkina and Tardos generalize this cost model to a tree 
of arbitrary depth, although in their setting the trees for different facilities are identical. Both the 
works present constant factor approximations for the respective versions, based on a primal-dual 
approach and local search respectively. Our model is similar to these models in that our facility 
costs are also submodular in the set of clients connecting to a facility. In our setting, the costs 
can again be modeled by a tree in which each node is associated with one or more clients; the cost 
of a collection of clients is given by the total cost of the union of subtrees rooted at those clients 
(as opposed to the portion of the tree "above" the clients). Therefore, neither of the two settings 
generalize the other. Moreover, facility costs in our setting are different for different facilities, 
although they are related through a multiplier per facility. Finally, while in Shmoys et al. and 
Svitkina et al. routing costs are given merely by the metric over facilities and clients, in our setting 
they are given by the distances times the total demand routed. 



For modeling information flow, Hayrapetyan et al. [6] have studied a single-source network 
design problem with monotone submodular costs on edges, and a group facility location problem. 
The network design setting generalizes ours in that edge costs can be arbitrary submodular functions 
of the clients using the edges; they note that an O(logn) approximation can be achieved via tree 
embeddings. In contrast, we obtain an O(logP) approximation, where P, the number of distinct 
packets in the system, is always at most n and generally much smaller. In the group facility 
location problem, edge costs are identical to those in RAND, but neither of the problems subsumes 
the other: the former assumes there are multiple facilities (sources) with fixed opening costs, but 
also limits the number of distinct packets per client to at most one. 

2 Problem definition and preliminaries 

The traffic-redundancy aware network design (RAND) problem is defined as follows. We are given 
a graph G = {V, E) with weights Ce G on edges e G E, and a special node s called the source. In 
addition, we are given a set T of clients or terminals located at different nodes in the graph. The 
source carries a set 11 of packets with |n| = P. Each packet p G 11 is associated with a weight Wp] 
we assume that the weights are integral. Each terminal t € T desires some subset of the packets; 
this is called the terminal's demand and is denoted by D(t). We use the convention D(s) = 11. Also 
let w^S) = J2pes'^p denote the total weight of a set S of packets. 

We assume that the collection of demand sets T> = {D{t)}t£T forms a laminar family of sets. 
In particular, for any two terminals ti,t2 G T, D(ti) n 0(^2) 7^ implies that either D(ti) C D(t2) 
or D(t2) ^ D(ti). We use a tree r to represent the containment relationship between sets in the 
laminar family. The nodes of r are sets in T>. A demand set X is a parent of another set Y if 
Y C X and there is no set Z ^T> with Y d Z d X . At the root of the tree is the universe 11 of 
packets. For a demand set X in the tree r, we use Tx to denote the terminals t £T with T){t) = X. 

We denote an instance of RAND by the tuple {G, T, V, r). 

The solution to RAND is a collection of paths V = {Pt}t£T, with Pt connecting the terminal t 
to the source s. Given this solution, an edge e G £' carries the set S'p{e) = \Jt&T:PtBe^{t) of packets. 
The load on the edge is w-p{e) = w{Sp{e)). We drop the subscript V when it is clear from the 
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context. The cost of the solution V is cost(7') = Xlees '^eW^'p(e). Our goal is to find a solution of 
minimum cost. 

Let OPT = argmin£gj^gi)3ig -p cost('P) be an optimal solution. For a subset W of terminals, we use 
OPT(VF) to denote the restriction of OPT to W, that is, the collection of paths {Pt G OPTjievy- 

In the traffic-redundancy aware facility location problem (RAFL), we are given a set J- of 
facilities, and a graph over T U T with edge weights Cg. Let c{u,v) denote the shortest path 
distance between nodes u and v in the graph under the metric c. Furthermore, each facility f J- 
has a cost A/ associated with it. 

A solution to this problem is an assignment A from terminals to facilities. The assignment 
specifies the set of packets that a facility / needs to produce in order to serve all the terminals 
connected to it: SaI/) = ^te:T:A{t)=f^i't) ■ The load on the facility is WA{f) = w{SAif))- We 
drop the subscript A when it is clear from the context. The cost of the solution A is cost(^) = 
YlfeT^f^Aif) + Z^teT ^(*))- The first component of the cost is called the facility 

opening cost Cf{A)^ and the second the routing cost Cr[A) of the solution. Once again our goal is 
to find a solution of minimum cost. 

Both RAND and RAFL are NP-hard because they generalize Steiner tree and metric uncapac- 
itated facility location respectively. Our goal is to find approximation algorithms. 

3 A constant factor approximation for RAFL 

In this section we present a constant factor approximation for the RAFL. For ease of exposition 
we assume that all packets have unit weight, and write \S\ for the total size or weight of a set S of 
packets. This assumption is without loss of generality. We further assume without loss of generality 
that minjgjrA/ = 1. 

The following is a natural LP-relaxation of RAFL. Here Xtj is an indicator for whether terminal 
t is assigned to facility /, and yj^p denotes the extent to which facility / produces packet p. 



Our approach begins along the lines of the filtering approach developed by Lin and Vitter [9] 
and employs some of the rounding ideas of Shmoys et al. [14] developed for metric uncapacitated 
facility location. In particular, given an optimal solution to the LP, we preprocess the solution at a 
constant factor loss in performance such that each terminal is assigned to a non-zero extent only to 
facilities for which the terminal's routing cost is within a small constant factor of the corresponding 
average amount in the LP. At this point, each terminal can be assigned to any facility to which it 
is fractionally assigned by the filtered solution, at a low routing cost. The key part of the analysis 
is bounding the cost for producing packets at facilities. 

Let J-{t) denote the set of facilities to which t is assigned fractionally by the filtered solution. 
In Shmoys et al.'s setting, in order to bound the cost of opening facilities, it is sufficient to find a 
"paying" terminal for each open facility such that the sets J-{t) for paying terminals are mutually 
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disjoint. For each terminal t that is not paying, there exists at least one representative paying 
terminal t' such that J-{t) and J-{t') overlap; any such terminal t is assigned to the facility opened 
by its representative terminal t' at a slight increase in routing cost as long as the average routing 
cost of t' is no more than that of t. In order to accomplish this, we process terminals in order of 
increasing average routing cost. 

In our setting, this approach has a basic flaw. The terminal t' may have a much smaller demand 
set compared to the terminal t. Then, if we assign t to the facility opened by t', the facility needs 
to produce many more packets and its new larger opening cost can no longer be charged to t'. 
Unfortunately, it is not possible to ensure that t' has both a small average routing cost than t as 
well as a larger demand set. Instead, we divide the process of opening facilities into two parts. 
First we determine which terminals are paying and which ones are not by processing facilities in 
order of increasing average routing cost. Then we decide which facilities to open by processing the 
non-paying terminals in the order of decreasing demand set sizes. 

The algorithm is described formally below. We first introduce some notation. Let 
denote the optimal solution to the RAFL LP given above. Let C*{t) = ^f^j^x^j:c{t,f) and 
Cf(t) = J2f£T^t f^f denote the average routing and facility opening costs respectively under the 
solution {x*,y*) associated with a terminal t. The total routing and facility opening costs of the 
solution are given by C* = J2teT |D(t)|C'*(t) and = J^feT^^peJiy} p^f respectively. Likewise, 
for a feasible solution {x,y), we use d.'''y\t) and Cf''^\t) to denote the average routing and facility 
opening costs associated with a terminal t respectively. We drop the superscript {x, y) when it is 
clear from context. Also let Cr{x,y) and Cf{x,y) denote the total routing and facility opening 
costs of the solution {x,y). 

Our algorithm proceeds in three stages. The first is a filtering stage in which we convert the 
solution (x*,y*) into a fractional solution {x,y) which satisfies the following property: for all t, f 
with xtj > 0, c{t, /) < aC*{t). Here a is a parameter that we fix later. For a terminal t, we use 
J^{t) to denote all the facilities / that are fractionally assigned to t in {x, y), that is, have xtj > 0. 

In the second stage of the algorithm, we classify terminals into paying terminals and free 
terminals . Essentially, a terminal t becomes a free terminal if any of the facilities in J-{t) is 
already expected to produce a large fraction of fs demand. We record this facility as fs tem- 
porary assignment A(t). To this end, we say that a set of terminals W covers a terminal t if 
|D(t) \ (U('gvKD(t'))| < ^|D(t)|. If a terminal t is not covered at any of the facilities in J-{t), then it 
becomes a paying terminal and can potentially pay for any of the facilities in T{t). Pay(/) tracks 
the set of terminals paying for a facility /. 

Finally, in the third stage of the algorithm, we pick a permanent assignment from terminals to 
facilities by considering facilities in decreasing order of the sizes of their demand sets. As a first 
cut approach, suppose that we assign a free terminal t to the facility at which it is covered {A{t)), 
and pay for that facility using the paying terminals associated with it. To ensure that no paying 
terminal t' ends up paying for two or more opened facilities, we consider all the free terminals 
that this paying terminal covers and assign those also to the first facility that the paying terminal 
pays for. The order in which we assign free terminals to facilities ensures that in this last step 
we do not increase the facility opening cost of the solution. Here is the catch: which facility is 
actually opened is decided by the free terminal t that starts this process and may be one of the 
more expensive facilities in the paying terminal f"s set J-{t'). In this case, the paying terminal does 
not have enough charge in the LP solution to pay for this facility. In order to avoid this situation, 
we consider all of the facilities that are "close" to t or to other free terminals covered by the paying 
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Algorithm 1 Rounding algorithm for RAFL 

Given: LP solution Return: Assignment A from terminals to facilities 

Phase 1: Filtering 

1: for all t G T do 

2: Let C*{t) = </c(t, /) and C}{t) = E/e^ x^jXf 

3: For all / G J", if c{t, f) > aC*{t) set Xtj = else Xtj = x^ j. 

4: Renormalize xtj so that ^fXtj = 1. 

5: Let J'(i) = {/ : xtj > 0}, C/(i) = T^f^jrXtjXf, and Cr{t) = ^f^^xtjc{t,f). 
6: end for 

7: For all / G and peU, set = maxtgr:D(t)3p a;^ j. 

Phase 2: Classification of terminals into paying and free 
8: Initialize paying and free terminal sets: T*' = and = 0. 

9: For alH G r initialize temporary assignment A{t) = 0, permanent assignment A{t) = 0, and 

cover Cov(t) = 0. 

10: For all / G initialize paying terminal set Pay(/) = and final paying set FPay(/) = 0. 
11: while T\{TP\J T^) 7^ do 

12: Let t = argmin^gj^y^-ppy-p/) C*{j). {Select terminal with least connection cost} 
13: if there exists / G T{t) such that Pay(/) covers t then 
14: T-^ = T'^ U {t} {t is a free terminal} 
15: A{t) = f 

16: Assign covering set for t: Cov(i) = {j G Pay(/) | D(j) n D(t) 7^ 0} 
17: else 

18: = u |i} |i is a paying terminal} 

19: For all / G update Pay(/) = Pay(/) U {t} 

20: end if 

21: end while 

Phase 3: Opening facilities 

22: For all t eTP assign level i{t) = \\0g2 C/(t)]; for all t e assign i{t) = minj£Cov(t) ^(j)- 

23: Initiahze t/ = g T-^ | = d} and for all t G t/ set rd(t) = {i' G | Cov(t)nCov(t') 7^ 0}. 

24: for d = to maxj^TP ^(j) do 

25: Initialize W = {Repeat till all terminals in have been assigned a permanent facility}. 
26: while 7^ do 

27: Let t G argmaXjg^^ |D(j)| {Select any terminal with the largest demand set}. 

28: Let t G argmin^grd(t) 

29: Let facility G argminygu,.^cov{t)^{i) ^Z" 

30: Let A{t) = 4>{t}{We say that t opens the facility (p{t).} 

31: For all t' G Fdit), assign A{t') = A{t). Update W = W\ {Td{t) + {t}). 

32: Assign final paying set for facility A{t): FFay{A{t)) = Cov{t). 

33: end while 

34: end for 

35: for all t e do 

36: Let A{t) G argminjgj-(-j) A/. 

37: end for 
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terminals that cover t. Of these we open the faciUty with minimum cost and pay for it using the 
paying terminals associated with /. 

While in our algorithm a paying terminal can end up paying for multiple opened facilities, in 
Lemma H] below we argue that the costs of those facilities decrease geometrically and so the sum 
can be bounded. 

We now formalize this argument. We begin by showing that the fractional solution (x, y) is not 
too expensive. 

Lemma 1. The solution {x,y) is feasible for the RAFL LP. Moreover Cf{x,y) < -^^fCj. 

Proof. {x,y) is feasible by construction. Note that for all t G T, X]/^/t = 1) otherwise the cost of 
the solution can be improved. Therefore, by Markov's inequality, f-c{ f t)>aC* ^*ft — ^/o^- Then, 
in the renormalization step (Step [4]) we set xtj to be no more than a/ {a — l)x* j. 

For all / € F and p € 11 we set yf^p to be max^gj-.^^^^^p xj j. Hence yf^p is no more than 
max^£'j^.£)(^)3p -^^xl ^, which is no more than a/ {a — ^)y*fp- This in turn implies the second part 
of the claim. □ 

Next we bound the routing cost of the assignment A. 
Lemma 2. For all i G T, c{i,A{i)) < 9aC*{{}. 

Proof. Let i be a paying terminal that is assigned a facility / in Step [36] of the algorithm. Then, 
/ € T{i) and therefore, by the definition of x and J-'(i), c{i, f) < aC*{i). 

Now, let i be a free terminal which is assigned a facility / in Step [30] or Step [31] of the algorithm. 
Say level l{i) = d and let t be the terminal that opened /. Then there are two possibilities: either 
i = t or i G T(i{t). In the former case, A{t) G T{i) and so c{i,A{t)) < aC*{i). 

Next we show that if terminal t opens a facility (in Step [30]l then for all t' € ^d{t) we have 
c{t',A{t)) < 3aC*(t'). Note that Cov(f ) n Cov(t) / and let j be a paying terminal in Cov(t') n 
Cov(t). We have A{t) G and A(t') € Since terminal j was selected in the second phase 

before t' we have C*{j) < C*.{t'). By triangle inequality we get that c{t',A{t)) < c{t',A{t')) + 
c{A{t'),j) + c{j,A{t)) < aC;{t') + 2aC;{j) < ?>aC;{t'). 

In Step [30] the opened facility, say /, is selected to be (/>(t) for the terminal t G ^d{t) with 
minimum C* value. Since / G ^{j') for some j' € Cov(t) we have c{f,t) < 3aC*(t). Also t is 
contained in Td{t), which implies that c{t,A{t)) < 3aC*{t). 

Again using triangle inequality we get the desired result: c{i, f) < c{i, A{t))+c{A{t),t)+c{t, f) < 
3aC*{i) + 6aC*{t) < 9aC*{i). Here the last inequality follows from the definition of t. □ 

Finally we account for the facility opening cost of the solution. Recall that 5'a(/) denotes the 
set of packets produced at / under the assignment A. We note that a facility / may be "opened" 
multiple times by different free terminals in Step [30] of the algorithm. In this case, we treat each 
subsequent opening as opening a new copy of / and designate a distinct set of terminals, FPay(/), 
to pay for all of the packets to be produced at the new copy freshly (even though some of them 
may already be assigned to the facility). 

Henceforth, for ease of exposition we will assume that each facility is opened at most once. 

The following lemma notes that |S'a(/)| can be bounded in terms of the demand sets of the 
terminals finally paying for this facility. 

Lemma 3. Let f be a facility opened by a free terminal t. Then S^if) = D(i) U UjgCov(t)D(j). 
Furthermore, \SA{f)\ < 2[ UjgCov(t) D(j)|. 
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Proof. Let t & and consider a terminal t' G Td{t). We claim that D(t') C D(t) U UjgCov(t)D(i), 
and this implies the first part of the lemma. Let j € Cov(t) H Cov(t'). Then, by the definition of a 
covering set, D(t') nD(j) 7^ 0. By laminarity, either D(j) D D(t') in which case our claim holds, or 
D(j) C D(t')- In the latter case, D(f)nD(j) 7^ implies D(i)nD(t') ^ 0. Once again by laminarity, 
either D(t') C D(t), or D(t') D E)(t). The latter case cannot hold because t is considered before t' 
in phase 3 and therefore |D(t')| < |D(t)|. In the former case our claim holds. 

The second part of the lemma follows from the definition of covering. □ 

Using this lemma we can bound the facility opening cost of the assignment A in terms of the 
average facility opening costs Cf{t) for the paying terminals t G in the fractional solution {x,y). 

Lemma 4. Zf I^a(/)|A/ < d^teTv \m\Cfit). 

Proof. First we bound the opening cost of facilities that were opened by free terminals. Lemma [3] 
implies that < 2| U,epp,y(^) D(j)| < 2E,GFPay(/) For facility / write i{f) = d iff 

the terminal opening it has level d. Note that for facility / with £{f) = d we have Xj < 2'^. 

Fix a terminal i G T^, and write Cov~"^(i) = {j G T-'^ | i G Cov(j)}. Say £{i) = I. Then 
Cfii) G (2^-1,2^]. By definition, for all 3 G Cov~^(i) we have ^(j) < i. 

We claim that i pays for at most one facility at level d for d < and does not pay for any 
facilities at level d > 1. We prove the first part by contradiction. Say there exits f ^ f such that 
£(/) = i{f') = d and i G FPay(/) n FPay(/'). Write t as the terminal that opens / and t' as the 
terminal that opens /'. Both t and t' are in Cov^^(z) and have level equal to d. Without loss of 
generality assume t was processed before t' by the algorithm. Then t' G ^dit) and we would have 
A{t') = A(t), contradicting the assumption that they are assigned to different facilities. 

For the second part of the claim, suppose that i G FPay(/) for some facility /. Then / is 
opened by a terminal t G Cov^^(i). As stated above, the level of such a terminal t must be no 
more than the level of i, hence i{f) < 

For a level d facility / we have Xj < 2'^ and hence X]/ FPay(/)9j ^/ — Sd=i — ^^/(i). We 
therefore get the following chain of inequalities. 

j2>^f\SA{f)\<2^Xf 

/ / jeFPay(/) 

= 2^|D(j)| E 

jeTP f: FPay(/)9i 

<8j2Cf{j) |D(j)| 

Here the first inequality follows from Lemma [3] and the second inequality follows from the bound 

:FPay(/)9 

To account for facilities that were opened by paying terminals in Step [36l of the algorithm we 
note that for any such facility h, the set SA{h) = D(t) where t is the paying terminal that opened 
h. Moreover Xh < XtjXj and hence the opening cost incurred by the algorithm is no more than 
the fractional value, A/i|D(t)| < Cf{t) |D(t)|. Hence the total facility opening cost incurred by the 
algorithm is no more than 9j2teTp D 

To complete the argument, we relate the costs Cf{t) to the total cost Cf{x,y). 
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Lemma 5. J2teTP < 2C/(x,y). 

Proof. When a terminal t is added to T'^ (Step [18] of the algorithm) it is not covered at any of the 
facilities in T{t). For / € J-" let L{t, f) = D(t) \U(/gPay(/)D(i') denote the set of packets in D(t) that 
is not covered at / at the time that t is considered. Here, Pay(/) denotes the set of terminals that 
is paying for / at the time that t is considered. Note that \L{t,f)\ > l/2|D(t)| for / G J^{t). For 
any facility / the sets L{t, f) are disjoint and partition the support of yf^p and hence the following 
chain of inequalities hold: 

J2yf,p > Yl yf'P 

P teTPp^Litj) 

> E E ^*-/ 

= |L(t,/)|xij (1) 

We can now derive the desired bound: 

= E ^/ E yf'P 
f p 

>E^/Ei^(*'/)i^*j 

/ t£TP 

^EE^D(t)i A/x,; 
= E^i^(*)iE^/^*./ 

teTP f 

= ^E \m\cf^'\t) 

teTp 

Here the third step follows from inequality ([T|) and the fourth step holds because for any t € 
and / € J^, either xtj = or / G T{t) and the size of the set L{t, f) is at least half its demand: 
>l/2|D(t)|.' □ 

Putting together the above lemmas we obtain the following theorem: 

Theorem 6. AlgorithmUl gives a 27 -approximation to the RAFL. 

Proof. Lemma [2] implies that for any terminal the routing cost is no more than 9a times the 
optimal. Also from Lemma H] and Lemma [5] we get that the total facility opening cost is no more 
than 18 times the total facility opening cost of the filtered solution Cf{x,y). Finally from Lemma 
[1] we have Cf{x,y) < a/{a — 1) Cp so the facility opening cost of the generated solution is no 
more than 18Q/(a — 1) times the optimal. Hence the algorithm achieves an approximation factor 
of max{9a, ^^}, which is minimized at a = 3 to give us a 27-approximation. □ 
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4 An O(logP) approximation for RAND 



In this section we develop an 0(log P)-approximation algorithm for the RAND where P = \H\. The 
basic observation that our algorithm hinges on is that if every pair of demand sets in V is either 
identical or disjoint, that is, the tree r is a two-level tree, then the problem becomes easy and can 
be approximated to within a small constant factor. In particular, then the problem becomes one 
of finding optimal Steiner trees connecting the terminals in Tx to s for every set X ^T). The cost 
of these Steiner trees is purely additive because the different sets in T> are disjoint. 

In particular, this implies that for any collection X containing disjoint sets of packets, we can 
construct a partial solution over terminals in Ux^xTx at a cost of constant times the cost of 
the optimal solution. This immediately suggests an algorithm with approximation ratio a constant 
times the depth of the tree r: define the level of a node in r as its distance from the root 11; for every 
level k, consider the collection of sets at level k and construct a constant factor approximation 
over terminals in Ux&x^Tx- 

In order to obtain a small approximation ratio through this approach, our algorithm first per- 
forms a preprocessing of the demand tree r to ensure, at small cost, that for any pair of demand 
sets in the tree where one is a parent of the other, the total weight of the parent is at least twice as 
large as the total weight of the child. Since the weight of every packet is integral, this implies that 
the depth of the preprocessed tree is at most logtt;(n), and gives us an 0(logtt;(n)) approximation. 

To obtain an O(logP) approximation, we need to do a more clever analysis. As discussed in the 
introduction, P denotes the number of effectively distinct packets in the instance. In particular, 
we can assume without loss of generality that P is equal to the number of nodes in the tree r. 
Our next key observation is that we can collectively bound the total cost of Steiner trees for nodes 
in a long root to leaf chain of nodes by a constant times the cost of the optimal solution (rather 
than by the length of the chain times the optimal cost). Here we crucially use the fact that each 
subsequent node in the chain has a weight at most half that of the preceding node, a fact that 
is ensured through our preprocessing step. Given this, we break up the demand tree into logP 
collections of chains, each of which corresponds to disjoint packet sets. The total cost of Steiner 
trees over each collection of chains can then be bounded to within a constant factor of the optimal 
cost, and we get an overall 0(logi-*) approximation. 

We now present our algorithm and analysis formally. 

Preprocessing the tree r. Our preprocessing phase is described in Algorithm [2] below. The 
preprocessing phase makes two changes to the given instance. First, it changes the demand sets of 
some terminals to supersets of their original demands. Second, after these changes to demand sets, 
if there are nodes in r that do not have any terminals associated with them, it merges these nodes 
with their parents. 

We obtain the following lemma. 

Lemma 7. Consider an instance {G, T, T>, r) of RAND. Then Algorithm \M returns an instance 
{GjTjV ,t') with the following properties: 

1. For every t G T, D'(t) D D(t) and w{T)'{t)) < 2w{T>{t)). 

2. V is a laminar family. 

3. For every pair of sets X,Y £D' such that X is a child ofY in t' , wiY) > 2w{X). 
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Algorithm 2 Preprocessing algorithm 

Given: Instance {G,T,T),t)] Return: New instance {G,T,T>\t') 
1: Perform depth first search over the tree r. 
2: for all nodes X ^T> encountered during DFS do 
3: Let Y be the parent of X in r 
4: if w{X) > ^w{Y) then 
5: For ah t € Tx, set D'(t) = Y. 

6; Merge X with Y. That is, set Ty = Ty U Tx, remove X from r, and reattach the children 

of X in r as the children of Y in the modified tree. 
7: end if 
8: end for 



4-. The cost of the optimal solution over {G,T,T>' ,t') is at most twice that of the optimal solution 
over (G, T, T>, r). 

5. Any feasible solution to {G, T, T>' ,t') is also feasible for {G, T, T>, r). 

Proof. We first remark that the demand of any terminal t is modified at most once, when its original 
node D(i) is encountered in the DFS over r; after this, the new node D'(t) is never processed again 
by DFS. Then, it is easy to see that the first property holds. The second property holds because r' 
is a tree over sets in T>' and therefore any two sets in T>' are either disjoint or one is an ancestor of 
another. The third property holds by construction: when the node X is encountered by the DFS, if 
the node survives the preprocessing, then it holds that w{X) < ^w{Y). The fifth property follows 
immediately from the first. 

To prove the fourth property, we note that for any two terminals ti and t2, D(ti) C D(t2) 
implies D'(ti) C D'(t2). Now consider any optimal solution V = {Pt}t<=:T to {G,T,T>,t). We 
will show that the cost of the solution V for the new instance {G,T,T>' ,t') is no more than twice 
its cost for {G,T,T>,t). For every edge e € E consider the set of terminals, say T(e), that use 
e: T(e) = {t : Pf 3 e}. Let T'{e) be the subset of T(e) of terminals whose demand is not 
contained inside the demand of any other terminals in T{e); that is T'(e) denotes the terminals 
with the maximal demand sets. Then, in the instance {G,T,V,t), w{Sp{e)) = ^^^^^'(e) ^(■^(*))• 
Moreover by our observation above, in the new instance {G,T,T>' ,t'), terminals in T'(e) are still 
the maximal demand terminals and w{S-p{e)) = "^t^x'ie) "^(^'i^))- The claim now follows from the 
first property. □ 

Main algorithm. We now proceed to describe the main algorithm. Henceforth we will assume 
that the given instance of RAND satisfies the properties listed in Lemma O in particular, property 
3. 

We first define some notation. For a demand set Y, let S*{Y) denote the optimal Steiner tree 
over Ty U {s}. The cost of this Steiner tree is cost(5*(y)) = w{Y) J2ees*(Y) '^e- Analogously we 
define the cost of an arbitrary Steiner tree S{Y) over Ty U {s} as cost(5(y)). 

In our analysis, we sometimes need to consider sets of nodes in r and bound their cost collec- 
tively. To this end, we define a chain y to be a set Y = {10;^!) • • • ,Yk} where for every i < k, 
Yi is a parent of y^+i. Recall that this implies Yq D Yi D ■ ■ ■ D Yf^ and w{Yi) > 2w{Yi^i) for all 
i < k. We call the node Yq the start of the chain Y. We say that two chains Yi and Y2 are disjoint 

if (Uy^y-Y) n (Uy^y-Y) = 0. 
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Algorithm 3 A logarithmic approximation for RAND 

Given: Instance (G, T, D, r) satisfying the properties in Lemma[71 Return: Collection of paths from 
each terminal to the source s. 

1: Perform depth first search over the tree r. 

2: for all nodes A G P encountered during DFS do 

3; Construct an approximately optimal Steiner tree 5(A) in G over Tx U {s}. 
4: For every t G Tx, return the unique path in 5(A) from t to s. 
5: end for 



The following is the main lemma of this section and allows us to bound the cost of large 
collections of nodes in r. 

Lemma 8. Let y = {Yi,Y2, ■ ' ' } be a collection of mutually disjoint chains. That is, for any 
Yi,Yj e y, Yi and Yj are disjoint. Then ^y^j; ^ygy cost(5*(y)) is at most 2cost(OPT). 

Proof. For Y G y, let OPT(y) denote OPT(Uy gyTy), the restriction of the optimal solution 
to the terminals associated with sets in Y. Note that because the chains in y are disjoint, 
^yg-j^ cost(OPT(y)) < cost(OPT). Therefore, for the rest of the proof we will argue that for 
any Y ey, J2YeY^^^^i^*(^)) ^ 2cost(OPT(F)), and this will imply the lemma. 

To prove the claim, fix a chain Y G y, and let Y = {Yq,Yi, • • • , Yf^} where Yq D Yi D ■ ■ ■ D Y^, 
and w{Yi) > 2tt;(yj+i) for all i < k. Recall that OPT(y) contains a path for every terminal 
in Uy^y'^y- denote the collection of paths for terminals in Ty. We say that e G Vy 

if e G Ut^TyPt- Let S(Y) denote the Steiner tree over Ty U {s} defined by Vy and note that 
cost(5*(y)) < cost(5(y)) = it;(y)EegPy ce. Then, 

5^COSt(5*(y))< J]ce J2 ^(^) 

On the other hand, because of the containment structure of sets in Y, 

cost(OPT(y)) = y Ce max w{Y) 

To conclude the proof we claim that for every edge e, Yly^Y VY^e w{Y) < 2 maxy^y.^^^^ w{Y). 
But this is easy to see because weights of sets y G y are geometrically decreasing by a factor of at 
least two. □ 

Next we show that the tree r can be decomposed into at most logP different collections of 
mutually disjoint chains. This along with Lemma[8]will allow us to prove our desired approximation. 

Lemma 9. Any demand tree r can be decomposed into at most logP different collections of mutu- 
ally disjoint chains 3^1,3^2, ■■ ■ )3^fc; such that each node in r belongs to exactly one collection. Here 
P is the number of nodes in r. 

Proof. We decompose the tree by finding a long chain, removing it from the tree, and then recursing 
on the remaining subtrees. Given a tree r with root Yq, we start at Yq and follow a path down to 
a leaf. Let m denote the number of nodes in r. At any node, we consider the sizes of the subtrees 
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rooted at the node in terms of the number of nodes in the tree; the path then moves to the child 
corresponding to the largest subtree. Note that all of the other remaining subtrees rooted at the 
node have at most m/2 nodes. 

Let Y = {Yq, Yi, - ■ ■ , Yfc} be the path obtained. This collection of nodes is a chain by definition. 
Consider removing the nodes in Y from r. We then note the following properties: (1) Every 
remaining connected component of the tree is of size at most m/2; (2) Let ti and T2 denote any 
two connected components. Then (Uxen-^) ^ (UxeT2-'^) = 0, that is, the connected components 
are mutually disjoint. 

The second property can be proved by contradiction. If two of the components are not disjoint, 
then by laminarity, they contain nodes Xi and X2 respectively such that Xi is an ancestor of X2 
in T. Since Xi and X2 belong to different connected components, there must be a node on the 
path between them in r that is in the chain Y . Then, all ancestors of this node are in Y including 
Xi which contradicts the fact that Xi belongs to a connected component left after removing the 
chain. 

We output {y} as the first collection of mutually disjoint chains. (Note that this first collection 
has only one chain in it.) Now let ri,T2,--- be the connected components left over in r after 
removing the nodes in Y . We recursively find collections of mutually disjoint chains in each of the 
components. Call these J^a, 3^*25 ■ ■ ■ etc. for the ith connected component. Then, we output the 
collections UjJ^jj for each j. Since the connected components are mutually disjoint, the collections 
we output are also mutually disjoint. Furthermore, since the sizes of the connected components 
decrease by a factor of 2 each time we go down a level of recursion, it is easy to argue that the 
number of collections we output are bounded by log-P where P is the number of nodes in the tree 
r that we started out with. □ 

We now present the main theorem for this section: 

Theorem 10. Let {G,T,T>,t) he an instance of RAND that satisfies the conditions in Lemma^ 
Then Algorithmic obtains a {2a log P)- approximation for the RAND over this instance where P is 
the number of effectively distinct packets in the instance, and a is the approximation factor of the 
Steiner tree algorithm used in Step of the algorithm. 

Proof. For every set X G D, let S{X) denote the Steiner tree built by our algorithm over Tx U {s}. 
Then we have cost(5(X)) < acost{S* {X)). We first use Lemma [9] to decompose the tree r into at 
most logP collections of mutually disjoint chains. Call these collections 3^i,3^2, • ' ' 
The total cost of our solution can now be written as 

Y;^cost{SiY)) = Y,Yl Ecost(5(y)) 

Y£V i<k Yeyi Y& 

^"E_E Ecost(5*(y)) 

i<k ygy. Y& 

< aE2cost(OPT) 

i<k 

= 2ak cost(OPT) 

Here the second inequality follows by applying Lemma [HI The theorem now follows by noting that 
k < log P. □ 
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Combining Theorem 1101 with Lemma [7] we get the following result. 

Corollary 11. Algorithms\^and\^tog ether obtain a (ia log P)- approximation for the RAND where 
a is the approximation factor of the Steiner tree algorithm used in the algorithm. 

We conclude this section by noting that Algorithm [3] can be implemented in a simple combina- 
torial fashion in O(n^) time as a generalization of Prim's algorithm for the minimum spanning tree 
problem as follows. This version of the algorithm obtains an (8 log P)-approximation. 

1. Let anc(t) denote the set of ancestors of t (those with demands that are strict supersets of 
D(t)), and peer(t) denote the set of its peers (those with demands identical to that of t). 

2. At any step call a terminal t eligible if all of its ancestors are already connected to the root. 

3. Initialize W = {s}. 

4. Let A{t) denote the distance in G from t to its closest node in n (anc(t) U peer(t)). 

5. While W ^ T, pick the eligible terminal with the smallest A(t) and connect it to its closest 
node inW n (anc(t) U peer(t)). Update W = WLl{t} and update A. 
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